Just-In-Time Data Distribution for Analytical Query Processing
نویسندگان
چکیده
Distributed processing commonly requires data spread across machines using a priori static or hash-based data allocation. In this paper, we explore an alternative approach that starts from a master node in control of the complete database, and a variable number of worker nodes for delegated query processing. Data is shipped just-in-time to the worker nodes using a need to know policy, and is being reused, if possible, in subsequent queries. A bidding mechanism among the workers yields a scheduling with the most efficient reuse of previously shipped data, minimizing the data transfer costs. Just-in-time data shipment allows our system to benefit from locally available idle resources to boost overall performance. The system is maintenance-free and allocation is fully transparent to users. Our experiments show that the proposed adaptive distributed architecture is a viable and flexible alternative for small scale MapReduce-type of settings.
منابع مشابه
انتخاب مناسبترین زبان پرسوجو برای استفاده از فراپیوندها جهت استخراج دادهها در حالت دیتالوگ در سامانه پایگاه داده استنتاجی DES
Deductive Database systems are designed based on a logical data model. Data (as opposed to Relational Databases Management System (RDBMS) in which data stored in tables) are saved as facts in a Deductive Database system. Datalog Educational System (DES) is a Deductive Database system that Datalog mode is the default mode in this system. It can extract data to use outer joins with three query la...
متن کاملEfficient Algorithms for Just-In-Time Scheduling on a Batch Processing Machine
Just-in-time scheduling problem on a single batch processing machine is investigated in this research. Batch processing machines can process more than one job simultaneously and are widely used in semi-conductor industries. Due to the requirements of just-in-time strategy, minimization of total earliness and tardiness penalties is considered as the criterion. It is an acceptable criterion for b...
متن کاملApproximate Query Processing in Decision Support System Environment
Both the approximate query process and decisional portals are emerging technologies in the decision support system environment. The former tool provides fast execution time for the analysis applications which require access to large amounts of data in the warehouse, by furnishing estimates of summary data with an approximation error acceptable for decision-maker users. The web-based second tool...
متن کاملRelaxed Operator Fusion for In-Memory Databases: Making Compilation, Vectorization, and Prefetching Work Together At Last
In-memory database management systems (DBMSs) are a key component of modern on-line analytic processing (OLAP) applications, since they provide low-latency access to large volumes of data. Because disk accesses are no longer the principle bottleneck in such systems, the focus in designing query execution engines has shifted to optimizing CPU performance. Recent systems have revived an older tec...
متن کاملQuantitative Comparison of Analytical solution and Finite Element Method for investigation of Near-Infrared Light Propagation in Brain Tissue Model
Introduction: Functional Near-Infrared Spectroscopy (fNIRS) is an imaging method in which light source and detector are installed on the head; consequently, re-emission of light from human skin contains information about cerebral hemodynamic alteration. The spatial probability distribution profile of photons penetrating tissue at a source spot, scattering into the tissue, and being released at ...
متن کامل